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METHOD AND DEVICES FOR PRIORITIZING ELECTRONIC MESSAGES 



FIELD OF THE INVENTION 

The present invention relates to a system and a method for reducing miproductive 
electronic messages trafl&c. More specifically, the invention relates to a system and 
a method to improve the efi&ciency of electronic messages usage. 

5 BACKGROUND OF THE INVENTION 

Electronic mail (e-mail) is an indispensable tool for both internal and 
external communication in most organizations. Unfortunately, the unrestricted use 
of electronic mail, without rules or limitations, has caused e-mail traflSc load to 
explode to the point where it is becoming a real threat to the infrastructure and to 

10 the workforce productivity of many organizations. 

Many employees receive literally tens or even hundreds of e-mails daily, a 
great deal of which are unimportant or irrelevant, forcing them to spend a major 
part of their workday reviewing e-mails, and leaving them short and shorter of time 
for other, more important tasks. Moreover, too often important messages get buried 

15 in the e-mail barrage, and as a result, the reader fails to respond or act upon them. 

Many techniques have been proposed for filtering and categorizing e-mail 
messages. Some of them are simple rule based methods that take simple actions 
such as automatic response, relying on parameters such as the sender's identity and 
a basic key word search (as described, for example, in US Patent No. 5,555,346). 

20 Others recommend more complex automatic actions, such as scheduling an 
appointment (as described in US Patent No. 6,553,358) or redirecting the message 
to an alternative communication device (e.g. US Patent No. 6,499,021), Many prior 
art systems perform an analysis of the message text, using machine-learning 
algorithms, and assign it to one of a predetermined set of categories. Some prior art 

1 
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techniques take assistance firom additional data such as the sender's identity being a 
member of the family or a work associate etc. 

A drawback of known techniques is their poor ability to predict the 
importance of received e-mails to the recipient and what is the right order to 
5 process the massive amount of messages in the user's inbox. Moreover, for 
employees in medium and large organizations, the vast majority of incoming 
messages originate from within the organization, and the importance of those is 
very difficult to determine based solely on their content For example, a device 
failure in a teleconomunications service provider's installation discovered by a field 

10 engineer, is reported by e-mail to a long distribution Ust comprisiag of development 
personnel, customer support personnel, regional sales people and management. The 
original e-mail message triggers a long chain of messages that may include 
questions, answers, comments and personal opinions relating to the original 
message. These messages reach a broad distribution list using the famous '^reply 

15 air* feature. Many recipients gradually lose interest in the discussion, but they keep 
beiag copied nevertheless. The e-mails in the chain are very hard to distinguish by 
analyzing their content, since they all include the original problem description, and 
they all iuclude relevant technical terms. On the other hand, their subjective 
importance and relevance for different recipients may vary significanfly. 

20 SUMMARY OF THE INVENTION 

It is an objective of the invention to provide a method and system that allow 
incoming messages to be pre-sorted or pre-tagged based on their relative 
importance as defined by pre-defined criteria, so as to allow the recipient to attend 
first to liiose received messages that are likely to be most urgent. 
25 The present invention addresses this objective by providing a method and 

system for assigning importance classes to electronic messages. The term 
"electronic messages" relates to e-mail messages, facsimile messages, or to text 
data of converted voice messages or pager messages. In the context of the present 
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invention, the term "importance class" relates to the degree of relevance to a certain 
recipient of a communication, assigned by a system using the method to each of a 
group, consisting at least one elranent, of electronic messages. The term "assigning 
importance classes" relates to associating each of a group, consisting at least one 

5 element, of electronic messages, with an importance class attribute, for example, by 
means of embedding, tagging or any other acceptable linking method. 

The method conq)rises identifying the sender of an electronic message, 
identifying the recipient of liie electronic message, determining a relative 
organizational distance between the sender and the recipient, and assigning the 

10 electronic message an importance class as a function of the relative organizational 
distance between the sender and the recipient 

In the context of the present invention, the term "relative organizational 
distance" relates to a metric derived from an organi2ation hierarchical structure. 
Specifically, in a preferred embodiment the relative organizational distance is a 

15 function of the level of work aflSliation between the corresponding departments of 
the message sender and the message recipient, and of the relative hierarcbical level 
of said sender and receiver. In a fiirther embodiment, this function is refined 
according to one or more of the following: (a) a set of global control rules 
according to the organizational structure and Ihe work affiliation among different 

20 departments and different hierarchical layers in the corporation; (b) a set of control 
rules according to ad hoc work groups formed firom time to time; (c) a global list of 
preferred origiaatuig addresses, external to the organization, &om senders afSliated 
with the organization. 

For exanq)le, a message fix>m an individual in the same department as the 

25 recipient is often attributed a higher importance than a message arriving from a 
different department, or a message firom the same position level or firom a direct 
supervisor or firom a person directly reporting to the recipient, is attributed a higher 
importance class than a message from a sender positioned much higher or further 
down in the hierarchy. A message from an essential unit for daily operation (e.g. a 
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message from a research unit to a development imit) is attributed a higher 
importance than a message from non-essential unit (e.g. legal or administrative). 
The organizational rules may also be for a specific period of time, for cxamplo, a 
project involving two development teams A and B collaborating for a specific 
5 period of time conveys higher importance on communication between the two. 

In a fiirfher embodiment, the coaqputation of the iiiq)ortance class attribute is 
a weighted average of the relative organizational distance, and at least one 
additional criterion, selected &om the following: (a) a globaUy defined content 
criterion; (b) a personally defined message sender criterion; (c) a personally 
10 defined content criterion; (d) a plurality of rules formed by a machine-learning 
algorithm or algorithms; (e) an analysis of e-mail message headers. 

The terai "globally defined content criterion" relates to a pre-defined set of 
key words, temis and phrases, constituting references of "important" and 
"unimportant" content items found m the text of the message body and/or Ihe 
15 message subject field. As the word "globaUy" indicates, this references set appUes 
for all users' messages. Similarly, the term "personally defined content criterion" 
relates to such references set of "important" and "unimportant" content items 
defined individually by a user, which applies only for messages arriving to that 
user's inbox. 

20 The term "machine-learning algorilhms" relates generally to coniputational 

models and techniques for automatic improvement of performance, based on past 
experience. In the context of the present invention, such algorithms are used for 
tracing users' actions vapon receipt of a message, for example, opening, replying, 
forwarding or deleting. Crossing said actions with other data related to the 

25 respective message, for exan^le, sender's identity, content items, etc. and 
comparing it with past information gathered, aUows derivation of new classification 
rules accordingly, based on the assumption that a user's behavior consistently 
observed through time indicates of the importance ascribed by tiie recipient to 
received messages. 
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The term "analysis of e-mail message headers" relates to detection, 
inteipretatioii and processing of a feature or features in message headers, 
specifically, message headers such as "to", "cc" and "bcc" fields. An example for 
such a feature or features is the number of the recipients in the "to", "cc" or "bcc" 
5 field. A user appearance as the sole recipient of a message may imply higher 
importance or relevance of said message to the recipient 

The present invention fiirflier relates to a method for streamlining the 
management of electronic messages, the method comprising: (a) assigning an 
10 importance class to each of said messages; and (b) streamlining said messages in a 
pre-determined manner in accordance with the respective importance class of each 
message. 

In a preferred embodiment, the streamlining includes displaying 
notifications of incoming messages either in a color that is characteristic of the 

15 respective importance class of each message, or with a distinctive tag that is 
charactaistic of the respective importance class of each message, or sorted in a pre- 
determined order, for exanqjle, in descending order, indicating tiieir relative 
iinportance in respect with their assigned importance classes. In a furfher 
embodiment, the streamlining includes blocking messages whose hnportance class 

20 is beneath a predetermined threshold, either with or without alerting the sender that 
a message has been blocked. 

It is to be appreciated that the invention uses the importance level parameter 
to enable multiple, different operations, for example, the user is able to differentiate 
the most important messages and focus his/her attention to those messages 

25 exclusively, Further examples are the transmission of the most important messages 
only to the e-mail client (referred to herewith as "selective synchronization"), or the 
archiving of less important ones (refOTred to herewith as "selective archiving"), thus 
the overall e-mail processing efficiency is increased. 

In the scope of the present invention is also a system for assigning 
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importance classes to electronic messages, said system comprising: 

(a) a message data extraction unit for identifying a sender and a recipient of 
an electronic message. The message data extraction unit is a series of computer 
instructions adapted to capture an identification of a message sender and the 

5 message recipient; and 

(b) a classifier coTq)led to the message data extraction unit and being 
responsive to a relative organizational distance between the sender and the recipient 
for assigning an importance class to the electronic message. The classifier is a 
module of computer program capable of associating identities of message senders 

10 and message recipients with pre-determined organizational data, for determining 
flie relative organizational distance between respective senders and recipients. The 
classifier is fiirfher capable of calculating an importance class attribute of a 
message, according to a relative organizational distance between the sender and the 
recipient. 

15 In a further embodiment, the classifier is further adapted to assigning said 

importance class based on at least one additional criterion, selected firom the 
following: 

(a) a pre-defined message sender criterion; 

(b) a pre-defined content criterion; 

20 (c) a plurality of rales formed by a machine-learning algorithm tracing 

user actions; 

(d) an analysis of e-mail message headers. 

According to a preferred enabodiment, the calculation of the importance 
class attribute by tiie classifier further involves a rules formation unit cornprising: 
25 (a) a set of global control lul^ relating to an organizational structure and 

work afaiiation among different departments and different hierarchical layers 
thereof; 

(b) a set of control rules relating to ad hoc work groups formed from 
time to time in said organizational stmcture; and 
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(c) a global list of preferred originating addresses external to the 
organizational structure- 

The set of rules in the rules formation unit may be either pre-determined (i.e. 
an immutable data object supplied by the system provider) or a dynamic product of 
5 the s ystem ( i.e. ana daptive, c onfigurable, r ule g enerating m odule, r esponsive t o 
user behavior, user configuration, administrator configuration, message content and 
features analysis, or other system variables). 

In a further embodiment the rales schema encapsulated in the rules 
formation unit includes three categories, as wiU be explained iti detail hereinafter: 
10 (a) organizational rales, (b) content dependent rules, and (c) user-behavior based 
rules. 

Organizational rales make use of the infra-structure of the organization to 
determine the importance of the message, as wiU be further explained and 
illustrated in detail hereinafter. 

15 In one embodiment, the system stores the departmental and hierarchical 

structure of the organization in a database, along with a definition of the work 
affiliation level between the various departments. The organizational structure 
information can be input to the system's database using available industry tools, or 
through a Graphical User Interface (GUI) specific to a system according to the 

20 invention. 

Alternatively, the system may use a "skeleton" hierarchical structure which 
is based on permanent relationship between \mits. 

The system enables to define ad hoc groups of users who, for a specific 
period of time, may require firequent and high priority communication level 
25 regardless of their regular position in the organization. 

In addition to the organizational data, refrared to as "global organizational 
data", which is set by the organization, each user is allowed to define individual 
preferences, referred to as "personal organizational data". A person, although not 
bound by the organization, may increase, for his/her own reasons, the importance of 
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messages received jfrom a specific unit 

The content rules in the second rules category determine the in:5)ortance 

level attribute according to the degree of correlation between the message's content 

and reference sets of "important" and ''unimportant content items. 
5 Most reference content items are global, although the system allows each 

user a limited ability to define personal reference content items, as long as they do 

not conflict with the global definition. 

The third rules categoiy employs a machiae-learning algorithm that updates 

and refines the said predefined organizational and content rules according to the 
10 actions taken by each user on previously received messages. The user's actions are 

recorded by a "user-behavior agent". Those actions may be interpreted as indicative 

of the subjective importance assigned to the message. 

Each rule category's contribution to the classification process may vary, 

depOTding on weighting factors and on the confidence of each category's decision, 
15 which can be either pre-determined or dynamically assigned, for example, the 

weighting coefBcient of the content category could be increased or decreased 

automatically according to the corresponding confidence in assigmng the 

importance class. In accordance with a preferred embodiment, the organizational 

category (both global and personal) has the highest contribution, i.e. in the range of 
20 60% to 70% of the total contribution sum. The behavior and content categories 

contribute the rest. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how it may be carried out in 
practice, some prefrared embodiments will now be described, by way of non- 
25 limiting example only, with reference to the accompanying drawings, in which: 

Fig, 1 is a diagram illustrating one embodiment of a system for classifying 
incomiag messages; 

Fig. 2 is a diagram illustrating m detail a classification system as featured in 
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Fig. 1; 

Kg. 3 i s a diagram illustrating the integration o f a system as depicted in 
detail in Fig. 1 and 2 in an overall environment adapted to the streanalining of 
electronic communication management; 
5 Fig. 4 illustrates in detail the e-mail server interface depicted in Fig. 3; 

Fig. 5 illustrates in detail the coirfiguration management and monitoring 
module depicted in Fig. 3; 

Fig. 6 illustrates in detail the e-mail chent intraface depicted in Fig. 3; 

Fig. 7 illustrates in detail the usct Amotions module depicted in Fig, 3; 
10 Fig. 8 is a diagram illustrating the integration of the environment depicted in 

Fig. 3 in an enterprise network; and 

Fig. 9 depicts an example of the flow of execution from a message arrival to 
a user's inbox until the notification thereof is presented to the user with the 
appropriate ioiportance class indication. 

15 

DETAILED DESCRIPTION OF THE INVENTION 

Fig. 1 illustrates a system (400) according to the invention, also referred to 
herewith as "classification engine", for classifyiiig incoming messages. The 
classification engine (400) includes an extract message data unit (470) for gleaning 

20 the information required to establishment of the message's importance class, by 
means of applying, within the classifier (450), a set of classification rules foimd in 
the rules formation imit (480). The rules formation xmit (480) is able to maintain a 
repository of pre-determined classification rules, as well as generating new ones, 
based upon the information obtained from the extract message data unit (470), and 

25 from the organizational structure data, both being transferred to rules formation unit 
(480), as shown in the drawing. Once the message's importance class is established 
by the classifier (450), it is then assigned to the message by the classifier (450). 
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Fig. 2 describes in detail the classification engine (400) of Fig. 1. The 
classification engine (400) includes a rules generator (410) which receives 
organizational data from the org chart application (30), possibly through drivers 
(80) or other intermediating software, and configuration parameters fi-om the 
5 system administrator and from the users. The user behavior agent (420) adaptively 
derives rules based on individual message handling. 

The rules are stored in the rules database (430). The classifier (450) uses the 
rales from the database, along with the message features supphed by a feature 
extractor (440) and content items suppUed by a text parser and analyzer (460), to 
10 determine the importance of each message. The rules generator (410), the user 
behavior agent (420) and the rales database (430) together constitute the rules 
formation unit (480) shown in Fig. 1. The feature extractor (440) together by the 
text parser and analyze (460) constitute the extract message data unit (470) shown 
in Fig. 1. 

15 

Fig. 3 illustrates the integration of a classification engine (400) as depicted 
in detail in Fig. 1 and 2 in an overall environment (10), also referred to herewith as 
a "system", adapted to the streamlining of electronic commimication UMnagement 
In the s ystem ( 10) the classification engine (400) is coupled to a user functions 

20 module (500) which encapsulates the implementation of user related fimctions, 
such as the selective synchronization and the selective archiving fimctions. The 
classification engine (400) and the user fimctions module (500) are both coupled to 
an e-mail server interface (200) which serves for the transfer of e-mail messages as 
well as control and status messages between the e-mail server and the components 

25 in the environment (10) interacting with it The user fimctions module (500) is also 
coupled with a configuration, management and monitoring module (100) which 
enables the system administrator to manage and configure the operation of the 
system (10) as well as to monitor its status and performance. The configuration, 
management and monitoring module (100) also allows for each individual user to 
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modify the classification rules encapsxilated in the classification engine (400), as 

well as selected modes of operation, for example, with the system (10) enabled or 

disabled, according to one's personal preferences. In fiirfher embodiments the 

configuration, management and monitoring module ( 100) comprises an interface 
5 for accessing configuration, management or monitoring fimctions over standard 

TCP/IP communication transport channels, such as HTTP (Hyper-Text Traiisfer 

Protocol), using a standard web browser application (70). 

The Org Chart application (30) encapsulates the organization chart, which is 

extracted from it through the drivers (80). In fiirfher embodiments the Org Chart 
10 application (30) includes tools for building a database describing the organizational 

structure, and for updating said database's records according to permanent or 

temporary changes in this structure. 

It is to be appreciated that direct interaction between the client computer and 

the system components is reduced to a minimum necessary. 

15 

Fig. 4 depicts in detail the e-mail server interface (200) shown in Fig. 3. The 
functionality of the e-mail server interface (200) comprises monitoring message 
traffic inside the e-mail server, interception of incoming and outgoing messages, 
transferring those messages or copies thereof to other modules featured in the 

20 environment (10) of Fig. 3, such as the classification engine (400) and the user 
functions (500), and sending the messages back to the e-mail server after or during 
their processing procedure by the different units of system (10). 

The trafSc monitoring unit (220) detects messages arriving to users' inboxes 
hosted on the e-mail server, and activates the Message Intercept and Return module 

25 (210), which posts them in the message buffer (230) for further processing by the 
classification engine (400). The message buffer (230) dequeues importance-tagged 
messages after being processed by the classification engine (400), and places them 
in the recipient's mailbox with the proper importance attribute. 



11 



wo 2004/030296 '^^^ ^fccT/IL2003/000760 



Fig. 5 illustrates in detail tiie cOTd&guration, management and monitoring 
module (100) depicted in Fig. 3. The configuiation, management and monitoring 
module (100) allows for system administrators to configure a plurality of 
parameters of the system (10) in Fig. 3 for optimal performance, and to adapt said 

5 system for specific needs of a certain organization. In addition the module (100) 
enables the administrator to monitor the performance of Uie system (10) and to 
perform fault tracking. E-mail users can also utilize the module (100) for personal 
preferences setting. The module (100) includes a configuration database (140) for 
maintaining the administrator's and the users' settings, and a performance database 

10 (130), for maintaining the performance data, such as messages distribution, 
malfunctions and exceptions, usage monitoring and other reports, as well as a Ust of 
logging parameters. The module (100) further supplies graphical tools for said 
configuration, management and monitoring purposes, in the means of an 
administrator GUI (110) and a user GUI (120), as will be explained and illustrated 

15 in detail hereinafter. 

Fig. 6 describes in detail the e-mail client interfece (300) depicted in Fig. 3. 
The e-mail client interface (300) includes an agent controller (320) that controls a 
behavior agent (40) sitting in the e-mail client (50) which captures flie user actions 
20 on received messages. The behavior agent (40) can be remotely installable and 
executable. The agent controller (320) receives user action information in addition 
to a designator to the corresponding message, and stores them in the user's actions 
database (310) for use by the classification engine (400). 

25 Fig. 7 describes in detail the user functions module (500) depicted in Fig. 3. 

The user functions module (500) includes a selective synchronization module (520) 
which allows e-mail clients to limit tiieir inbox exclusively to e-mails with a 
predefined importance level, and a selective archiving module (510) which allows 
archiving messages residing in the user's inbox, whose importance class is beneath 
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a predetermined threshold. The selective archiving module (510) also enables 
convenient, easy search and retrieval of previously archived messages. 

Specifically, the selective synchronization module (520) allows a selective 
traasmission of e-mail messages from the e-mail server's inbox to the cUent 
5 computer's inhox:, according to a predefined in[5)ortance level. 

The selective mailbox synchronization procedure in a system according to a 
preferred embodiment of the present invention is as follows: 

(a) The traffic monitoring unit (220) in the e-mail server interface 
module (200) receives a "request for synch" event from the e-mail server 
10 (b) The s elective s ynchronization m odule ( 520) c hecks t he i mportance 

tag of each message before its transfer to the chent computer, and approves or 
prevents the transfer according to the corresponding importance tag. 



Each e-mail user sets the following parameter that affects the functionality 
15 of the selective synchronization module (520): 

Synchronization_importance (messages of this importance level and ahove 
is transferred to the cUent) 



The selective archiving module (510) allows "smart archiving" of previously 
20 received messages, by means of packing groups of e-mails residing in a user's 
inbox into archives according to their importance class, and the time elapsed since 
they were received. 

The Smart Archiving function for each user is activated based on the 
preferences selected by the user: 
25 Enable/disable archiving 

Ma?cjiumber_of_mails in inbox (start archiving after nxraiber is exceeded) 
Max_inboxjtnemory_size (start archiving after size is exceeded) 
Archive_importance (archive messages at this importance level or lower) 
Archive_age_resolution (archive messages older than this value) 
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Destination folder names 

Archived Messages are stored in compressed fomaat in order to save storage 

space. 

Fig. 8 illustrates the integration of the system (10) in an enterprise network, 
5 according to one embodiment of the present invention. The enterprise network is a 
model of a working environment in which different computer programs and 
applications are mnning on a distant machine external to a user's computer and are 
accessed by the user through a web browsor. The main server (710) runs the major 
components of the system ( 10), such as the classification engine (400), the user 
10 functions m odule ( 500), and o tiier r elated functions. The e -mail s erver ( 730), a s 
well as the management console (720) and the users* conq)uters (740)-(760) are all 
connected to the main server (710) through the enterprise network as shown in Ihe 
drawing, thus enabling access to the configuration, management and monitoring 
module (100) of the system (10), using the graphical tools supplied by the module 
15 . (100), such as the administrator GUI (110) on the management console (720), and 
the user GUI (120) on the users' conqjuters (740)-(760). 

Fig. 9 depicts an example of the flow of execution from arrival of a message 
to a user's inbox tmtil the notification thereof is presented to the user with the 

20 appropriate importance class indication. In step 800, the traflac monitoring unit 
(220) in the e-mail server interface (200) gets an event on a new message being 
stored in a users' inbox, i.e. an incoming message. In step 810, the traffic 
monitoring unit (220) determines whether the respective message related with said 
event is, indeed, new. If the message is new, in step 820 the message intercept and 

25 return unit (210) then copies the message and stores it in the message buffer (230), 
firom which the classification engine (400) dequeues the message in step 830, sets 
its itrrportance tag in step 840 and restores it in the message buffer (230). Finally, 
the message is returned to the mail server (overwriting a previous importance field, 
if any, or the entire message). In a preferred embodiment, the message is further 
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used, in step 850, for adaptive algorithm traiaing, as will be ejcplained in detail 
hereinafter, evaitually resulting, in step 860, in the update of classification rules 
maintained in the rules database (430) accordingly, as shown in the drawing. 

It is to be appreciated that the method according to the present invention is 

5 being implemented on a copy of the message that is external to a central repository 
on which incoming messages are stored so as to enable uninternipted service in the 
case that said method fails to operate or malfunctions, and fiirfhermore to avoid loss 
of messages or messages parts. 

In a preferred embodiment, the messages are stored by the e-mail server in 

10 the user's inbox concurrently with being processed by the system (10). After an 
importance class of a message has been determined the respective importance 
attribute of the message is updated accordingly. 

A message stored in the message buffer (230) for classification is processed 
15 first by the feature extractor (440) and by the text parser and analyzer (460). 

The text parser and analyzer (460) extracts content items from the message 
subject field and firom the message body and passes them to the classifier (450) and 
to the user behavior analyzer (420). Content items are a list of key words, terms and 
phrases found in the text The extraction of content items can be performed using 
20 known techniques such as m Schwei^ofer and Winiwarter, **Refining the 
selectivity of thesauri by means of statistical analysis", in Intl. Congress on 
Terminology and Knowledge Engineering, 1993. 

The feature extractor (440) extracts the following features from each 
25 message and passes them to the classifier (450) and to the user behavior analyzer 
(420): 

Sender's e-mail address and/or nickname 
Recipient's e-mail address and/or nickname 
Number of recipients in the "to" field 
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Number of recipients in the "cxj" or ^Tjcc" field 

The classifier (450) applies rules from all categories in order to determine 
the message importance level. The rules are drawn fi:om the rules database (430). 
5 The following categories of classification rules are applied, as will be explained in 
detail hereinafter: 

(a) organizational 

(b) content 

(c) behavioral (adaptive) 

10 

The final importance level is a result of the weighted average of the outputs 
of all rule categories. The weighting factors are a fimction of predetermined values 
as well as the corresponding confidence level The classifier contains a collection of 
reference "important" content items and reference **unimportanf * content items, for 
15 comparison with content items found in the messages. 

The importance class indication (the importance tag) is attached to each 
message. If the importance class is ^'unknown", no tag is attached to that message. 

20 The implementation of the importance tag enables a clear display of the 

importance class, by means of different colors or any other clearly visible tag, and 
allows "importance based" sorting in the e-mail chent. 

Four Importance classes are defined: high, medium, low, unknown. 
25 Additional classes can be added upon need. 

The final importance class is determined as a weighted average of the 
outputs of all rules categories. The output of each rale category is assigned a 
confidence level. If none of the conditions of a certain rule is fulfilled then the 
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rule's output is: iinportaiic^=imknown. The final irnportaDce class is calculated 
according to the following formula: 



Fiualjunportance = INT 



YW(i)^IC(i) 



5 



Where: 

5 IC(i) - importance class determined by rule category i, where IC = 1 for low 

importance, 2 for medium importance, 3 for high importance, 0 for unknown 
importance. 

W(i) - weighting factor of rule category i, where 0<W<1 . 
10 The weighting factor for each rule category is calculated as follows: 



W = W_const*CF 



Where: 

15 Wjconst - the constant weighting factor for that category as configured by 

the administrator. 

CF = confidence factor for that category (CF assumes values between zero 
and one, where a zero designates no confidence and a one designates fidl 
confidence). 

20 

All . weighting coefficients, decision thresholds and other constants are 
configurable through the administrator GUI (110). A limited number of parameters 
are configurable through the user GUI (120). 

25 The "organizational" rules category may include mles as in the following 

list: 

If <organizational distance=low> then < iraportance=high> 

If <organizational distanc^=medium> then < importance=mediimi> 
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If <orga3Mza1io]ial distance=Mgh> then < rcaportaiice=low> 

If <sender is executive> then <importance=high> 

If <sender & recipient belong to workgroup> then <importance=high> 

If <extemal address & belongs to preferred list> then <importance=high> 

5 

The organizational distance is calculated as follows: 
Org_dis = dep_dis + her_dis 

Where: dep_dis is the departmental distance and her_dis is the hierarchal 
10 distance. 

Dep_dis can assume 3 values - 0,1,2 (where 0 corresponds to the same 
department). 

Her_dis can assume the values - 0,1,2,3,.... (where 0 corresponds to the 
same hierarchal layer, 1 corresponds to +/- 1 level difference etc.). 

15 

Organizational distance is defined as follows: 
If (Orudis <= Tl) then (organizational distance = low) 
If (org_dis > Tl and <= T2) then (organizational distance = medium) 
Else (organizational distance = high) 
20 Default values: Tl=l, T2=2 

Each user may dejBne a set of personal preferences relating to senders 
internal or external to the organization. Such user's personal settings can only 
increase the importance set by tiie administrator. For example, personal preferences 
25 may be of the following types: 

(a) If <sender belongs to preferred_intemal_address> then 
<importance==high> 

(b) If <sender belongs to preferred_extemal__address> then 

<iniportance=high> 
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The system allows setting importance classes according to a global list of 
preferred originating addresses, external to the organization, from senders affiliated 
with the organi2ation, such as customers, suppliers, partners, etc. The hst of 
5 preferred external addresses is defined by the system administrator or is drawn from 
the data bases of existing enteiprise plications such as enterprise resource 
planning ^RP) or customer resource management (CRM). 

The system allows the setting of importance classes according to the SMTP 
10 (Sitqple Mail Transfer Protocol) message headers such as "to", "cc" and "hcc" 
fields. Following are some examples for such rules: 

If <recipient alone in "to" header> then <importanc^high> 
Else If <^ of recipients in ''to" header less then Nl> then 
<importanc^=medium> 
15 If <recipient alone in "cc" header> then <importance=medium> 

If <recipient alone in "bcc" header> then <importance^high> 
Else <importanc^=unknown> 

The importance class set by the organizational category is according to the 

20 following rule: 

If at least one mle voted "high" then IC=high 
Else if at least one rule voted "medium" then IC=^edium 
Else if at least one rule voted "low" then lOlow 
Else IO=unknown 

25 

The confidence level of the organizational rule category is calculated as 
follows: 

If one rule voted for the selected importance class then CF = CF_min 

If two rules voted for the selected importance class then CF = CF_medium 
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If three or more rules voted for the selected importance class then CF = 
CF_jnax 

Defaxdt values: CF_nmi=0.7, CF_mediiim=0.85, CF_max=l 

5 The function of the content based rules is to assist in classifying e-mail 

messages on two levels: 

(a) Higher rating of relevant, work-related content 

(b) Lower rating of irrelevant content such as jokes, music files, video 
files, solicitation etc. 

10 There are numerous prior art methods for classifying text messages 

according to their content (one such method is described in US patent 6,519,580). 
One simple method is a simple "search and counf operation of given reference 
keyword. The present invention is not limited to any specific text classification 
method. 

15 

The following content rules are applied: 

If <message_subject_content_item belongs to irrelevant_class> then 
<unportance=low> (multiple irrelevant classes are supported) ELSE If 
<message_subject__content_Jtem belongs to relevant_class> then 
20 <importance^high> (multiple irrelevant classes are supported). 

If <message_body_content_iteni belongs to irrelevant_class> then 
<importance=low> (multiple irrelevant classes are supported). ELSE If 
<message_Jbody_content_item belongs to relevant_class> then <importance=high> 
25 (multiple irrelevant classes are supported). 

Most "reference content-classes" are defined by the system administrator. A 
limited number of "personal reference content classes" can be defined by each user. 
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provided that they do not conflict with administrator defined classes. For example, 
if a user specifies a key word as "unimportanf ' and the same keyword was already 
defined by the administrator as "important", the system rejects the personal setting. 

5 Most prior art methods for classifying text messages also generate a 

confidence indication liiat can be used for calculation of the final importance class. 
For a single key word "search and count" method the following simple algorithm 
can be used to estimate the confidence factor of the content rule category: 

If # of matching key words is larger tihen N_min but smaller then 
10 N_medium then CF = CF_min 

If # of matching key words is larger then Njtnedium but smaller then 
Njtnax then CF == CFjnedium 

If # of matching key words is larger then N_max tiien CF = CF_max 

I 

15 Default N values: N_nmi=2, N_medium=4, N_max=6 

Default CF values: CF_min=0.7, CF_medium=0.85, CF_max=l 

Ad^tive rules are generated based on the user's actions taken on previously 
received e-mail messages. The behavioral information is recorded for aU e-mail- 

20 receiving users as defined by the system administirator. The behavioral information 
for each user is recorded on a statistically sufficient number of messages (several 
hundreds). The behavioral information is used to deduce the message importance. 
This information is used to produce rules that relate various message attributes to 
the iriq>ortance level of messages. Those rules are used in Ihe process of classifying 

25 new messages. The classification outcome of the adaptive algorithm is 
accompanied by a corresponding confidence factor estimation that is used for the 
calculation of the final importance class as described above. 



The following '*user actions" taken on received e-mail messages are 
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recorded by the *^lse^ behavior agent', causing minimal interference with the client 
computer: 

(a) Time elapsed from the moment the message was stored in the inbox 

till it was opened by the user 
5 (b) Time elapsed from the moment user started e-mail activity (after the 

message was stored in the inbox) till it was opened 

(c) Time during which the message remained open 

(d) Replying to the message 

(e) Forwarding the message 

10 (f) FiliQg or saving the message 

(g) Deleting the message 

(h) Printing the message 

The following importance criteria are appUed: 
15 If <time_to_open=short> then <importance=high> 

If <time_to_open_since_activity_started=short> then <importance=high> 
If <time_message_opened=long or time_attachment_open=long> then 

<importanceNhigh> 

If <message replied> then <in:q>ortanceF=high> 
20 If <message forwarded and filed/saved> then <importance=high?> 

If <message printed> then <nnportance=high> 
If <message forwarded> then <importance=medium> 
If <message filed/save€t> then <importance=medium> 
If <message deleted> then <importanc^low> 

25 

The final importance class is determined as a weighted average of the above 
criteria, accordiag to the following formula: 

IC W= INT ^ +0-5 
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The following attributes are extracted and rated: 

(a) Message sender 

(b) Organizational distance between sender and recipient 
5 (c) "Subjecf ' field content 

(d) Message body content 

(e) Recipient alone in **to" header field 

(f) Recipient alone in "cc" or 'Tjcc'' header field 

(g) Or a combination of two or naore of the above 

10 

The adaptive, behavioral based, rules are generated using, for example, prior 
art techniques such as: 

(a) Naive Bayes 

(b) Rule learning or machine learning Algorithms 
15 (c) Support Vector Machine 

Adaptively generated rules have a limited, configurable, validity time 
(training is performed over the last N days). The algorithm is appUed continuously 
in order to adjust for dynamic conditions. 

20 The system allows the administrator to define the organizational structure 

using a Graphical User Interface (GUI), or to import it from an existing enterprise 
database. After defining/importing the org chart, the administrator is able to 
subscribe employees to the invention's services. This is done by clicking the 
"subscribe users" button, and then selecting from the org chart the following 

25 Options: 

(a) The entire company 

(b) Whole departments 

(c) Whole hierarchal layers 

(d) Individual users 
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For all subscribed employees, the system searcbes the coiporate database 
and retrieve their personal details (e-mail address, nickname). For names not found, 
the system prompts the administrator to manually enter the corresponding data. 

5 The system provides a graphical tool for convenient definition of the org_dis 

parameter. The tool is appUed to the standard graphical view of the org chart. 

Dep_dis between departments is defined by marking a distance between two 
departments by cUcking tbem one after another. The GUI prompts the user to 
10 choose the distance value, for both directions (message sent firom one department to 
the second and for the opposite direction). A definition of distance between two 
departments also applies to all their sub departments. 

Specifying a distance between sub departments (sub department may consist 
also an individual in a department or a group of individuals in a department) 
15 overrides the distance defined between the parent departments. 

- A distance is marked between two individuals by selecting the two. An 
individual distance definition overrides the distance defined between the 
departments and/or the sub departments (for said individuals only). 

20 The system allows the selection of a group of individuals that belong to a 

workgroup (with or without a time limit). For all individuals who belong to the 
workgroup - org^dis = low. The system alerts the administrator T before expiration 
of the workgroup vaHdity period (T= 1 week). For workgroup definition the system 
treats department managers as individuals and not as representatives of their 

25 departments. 

The add/remove users screen allows the administrator to add or remove 
employees' subscription to the invention's services after the initial system 
installation. After selecting the add/remove button the administrator is presented 
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with the oig chart screen, where he is able to add/remove/change the user ia the org 
chart, and then subscribe/unsubscribe him in the procedure described above. 

Data on the performance of the invention is logged in the database for 
5 history recording, offline performance analysis, perfonnance improvements and 
user behavior profiling. The system allows for easy application of various statistical 
analysis operator (average, standard deviation, histograms, correlation etc.) and 
graphical presentation of the results. A partial list of parameters for logging may 
include: 

10 (a) Disiribution of messages according to their importance level (per 

corporate, department or individual user). 

(b) Correlation between rules categories and specific rules results, and 

the fibaal importance setting. 

(c) Correlation between user behavior and the final importance setting 
15 (per corporate, department or individual user). 

(d) Monitoring of changes made by users to their personal preferences. 

(e) The behavior of the above data over time (daily, weekly, monthly and 
yearly resolution). 

It will also be understood that the system according to the invention may be 
20 a suitably programmed computer. Likewise, the invention contemplates a con^uter 

program being readable by a computer for executing the method of the invention. 

The invention further contenq>lates a machine-readable memory tangibly 

embodying a program of instractions executable by the machine for executing the 

method of the invention. 
25 In the method claims that follow, alphabetic characters and Roman numerals 

used to designate claun steps are provided for convenience only and do not imply 

any particular order of performing the steps. 
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CLAIMS: 

1. A method of assigning importance classes to electronic messages, the 
method comprising: 

(a) identifying the sender of an electronic message; 
5 (b) identifying the recipient of the electronic message; 

(c) determining a relative oiganizational distance betwe^ the sender and the 
recipient; and 

(d) assigning the electronic message an importance class as a function of the 
relative organizational distance between the sender and the recipient. 

10 2. The method according to claim 1 , wherein said function is further weighted 
by at least one additional criterion, selected from the following: 

(a) a globally dejBned content criterion; 

(b) a personally defined message sender criterion; 

(c) a personally defined contmt criterion; 

15 (d) a plurality of mles formed by a machine-learning algorithm or algorithms. 

(e) an analysis of e-mail message headers. 

3. The method according to claim 1, wherein said electronic message is an 
electronic mail (e-mail) message. 

4. The method according to claim 1, wherein said electronic message is a 
20 facsimile message. 

5. The method according to claim 1, wherein said electronic message is a 
converted voice message or pager message text data. 

6. The method according to claim 1, wherein (c) is determined from an 
organizational structure of a corporation and said function is refined according to 

25 one or more of the following: 

(a) a set of global control mles according to the organizational structure and 
the work afiSliation among different departments and different hierarchical layers in 
the corporation; 
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(b) a set of control rules according to ad hoc work groups formed from time to 
time; 

(c) a global list of preferred originating addresses, external to the oiganization, 
from senders affiliated with the organization. 

5 7. A method for streanalining the management of electronic messages, the 
method comprising 

(a) assigning an importance class to each of said messages according to the 
method of claim 1; aad 

(b) • streamlining said messages in a pre-detennined manner in accordance with 
10 the respective importance class of each message. 

8» The method for streamlining the management of electronic messages 
according to claim 7, wherein streamlining the messages includes displaying 
notifications of incoming messages in a color that is characteristic of the respective 
importance class of each message. 
15 9. The method for streamlining the management of electronic messages 
according to claim 7, wherein streamlining the messages includes displaying in 
association with notifications of incoming messages a distinctive tag that is 
characteristic of the respective importance class of each message. 

10. The method for strea33ilining the management of electronic messages 
20 according to claim 7, wherein streamlining the messages includes sorting 

notifications of incoming messages in a pre-detennined order, indicating the 
relative importance of said messages in respect with their assigned importance 
classes. 

11. The method for streamlining the management of electronic messages 
25 according to claim 7, wherein streamlining the messages includes blocking 

messages whose importance class is beneath a predetermined threshold. 

12. The method according to Claim 11, further including alerting the sender 
that a message has been blocked. 

13. The method according to claim 2, wherein the at least one additional 



27 



wo 2004/030296 

CT/IL2003/000760 



criterion is a function of content in the noiessage subject field and/or in the message 
body. 

14. The method according to claim 2, wherein (d) includes analyzing actions 
taken by said recipient on receipt of said messages so as to establish a relative 

5 importance ascribed by the recipient to received messages. 

15. The method according to any one of the preceding claims being 
implemented on a copy of the message that is external to a central repository on 
which incoming messages are stored so as to enable unintemq)ted service in the 
case that said method fails to operate or malfimctions. 

10 16. The method according to any one of the precedmg claims including 
selectively transmitting e-mail messages from an e-mail server's inbox to a client 
computer's inbox, according to said importance class. 

17. The method according to any one of the preceding claims, further including 
grouping messages residing in a user's inbox into archives, according to their 

15 importance class and an elapsed time since they were received. 

18. The method according to claim 1, wherein (c) includes using a graphical 
tool to define the organizational distance between different entities within the 
organization. 

19. A system for assigning inq)ortance classes to electronic messages, said 

20 system comprising: 

a message data extraction unit for identifying a sender and a recipient of an 

electronic message; and 

a classifier coupled to the message data extraction unit and being responsive to a 
relative organizational distance between the sender and the recipient for assigning 
25 an importance class to liie electronic message. 

20. The system according to claim 19, wherein the classifier is further adapted 
to assigning said importance class based on at least one additional CTiterion, 
selected from the following: 

(a) a pre-defined message sender criterion; 
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(b) a pre-defined content criterion; 

(c) a plurality of rules formed by a machine-learning algorithm tracing user 

actions; 

(d) an analysis of e-mail message headers. 

5 21. The system according to claim 19 or 20, further including a rules formation 
unit comprising: 

(a) a set of global control rules relating to an organizational structure and work 
affiliation among different departments and different hierarchical layers thereof; 

(b) a set of control rules relating to ad hoc work groups formed from time to 
10 time in said organizational structure; and 

(c) a global list of preferred originating addresses external to the organizational 
stracture. 

22. A program storage device readable by machine, tangibly embo<fying a 
program of instmctions executable by the machine to perform method steps of 

15 assigning importance classes to electronic messages, the method comprising: 

(a) identifying the sender of an electronic message; 

(b) identifying the recipient of the electronic message; 

(c) detemiining a relative organizational distance between the sender and the 
recipient; and 

20 (d) assigning the electronic message an importance class, according to the 
relative organizational distance between the sender and the recipient 

23. A computer program product comprising a computer useable medium 
having computer readable program code embodied therein of assigning importance 
classes to electronic messages, the compat&i program product comprising: 

25 computer readable program code for causing the computer to identify the sender of 
an electronic message; 

computer readable program code for causing the computer to identify the recipient 
of the electronic message; 

computer readable program code for causing the computer to determine a relative 
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organizational distance between the sender and the recipient; and 
computer readable program code for causing the conq)uter to assign flie electronic 
message an importance class, according to the relative organizational distance 
between the sender and the recipient. 
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